Add database persistence benchmarks by benthecarman · Pull Request #915 · lightningdevkit/ldk-node

benthecarman · 2026-05-31T23:06:09Z

built on top of #919
Closes #908

Benchmark realistic payment and pending-payment persistence workloads
across filesystem, SQLite, and optional PostgreSQL stores. Use the async
KV-store APIs so the measured paths match the database interfaces used by
async persistence.

Also adds support for different db backends to the current payment benchmark.

ldk-reviews-bot · 2026-05-31T23:06:12Z

👋 Hi! I see this is a draft PR.
I'll wait to assign reviewers until you mark it as ready for review.
Just convert it out of draft status when you're ready for review!

Use a temporary rust-lightning fork revision that exposes async migratable KV-store support. This lets ldk-node migrate filesystem stores without reimplementing LDK's migration logic locally. Co-Authored-By: HAL 9000

In lightningdevkit/vss-server#101 we dropped the Java version of `vss-server` and moved the Rust version to the repo root. Here we account for the updated location in our integration tests.

Read and write BDK wallet state through async KVStore helpers while keeping the current WalletPersister entry points bridged through the node runtime. This reduces the wallet persistence surface that still depends on KVStoreSync. Co-Authored-By: HAL 9000

Static invoice persistence already runs from async handlers, so use KVStore directly instead of routing those reads and writes through the blocking KVStoreSync trait. Co-Authored-By: HAL 9000

tnull · 2026-06-03T12:32:12Z

Feel free to rebase on top of #919 and let me know if you see any differences in the benchmarks.

Persist peer store updates through async KVStore operations. The synchronous node APIs keep bridging at their runtime boundary while async event handling awaits peer persistence directly. Co-Authored-By: HAL 9000

Persist DataStore mutations through async KVStore operations while keeping the existing synchronous APIs bridged through the node runtime. Async event handling now awaits payment store writes directly. Co-Authored-By: HAL 9000

Persist node metric updates through async KVStore writes and await them from the chain, gossip, and scoring tasks. This removes the remaining blocking metrics writer while keeping the helper name stable. Co-Authored-By: HAL 9000

Avoid introducing a temporary macro when moving node metrics persistence onto async KV storage. Co-Authored-By: HAL 9000

Persist the on-chain wallet through BDK's AsyncWalletPersister so wallet state writes use the async KVStore path. Existing synchronous wallet APIs keep bridging through the node runtime until their callers are made async. Co-Authored-By: HAL 9000

Open filesystem stores through the async LDK migration helper so v1-to-v2 store migration no longer depends on the blocking KVStoreSync migration path. Co-Authored-By: HAL 9000

Move the existing in-memory test store into a shared module without changing its behavior. This lets integration tests reuse it while keeping the later async TestSyncStore change separate. Co-Authored-By: HAL 9000

Keep the shared test store move as a pure code move by restoring the original comments and spacing. Co-Authored-By: HAL 9000

Exercise async KVStore operations in TestSyncStore and filesystem migration tests while keeping the temporary sync comparison path until the final KVStoreSync removal. Also route pathfinding score export through async KVStore reads. Co-Authored-By: HAL 9000

Drop the remaining synchronous KV store trait bounds and implementations. After the preceding migrations, custom stores only need to provide async KVStore persistence. Co-Authored-By: HAL 9000

Compile the VSS persistence tests after the shared KV store helper moved to async persistence. Co-Authored-By: HAL 9000

Resolve the VSS schema through async setup on the caller runtime so VSS stores no longer create or retain a Tokio runtime. NodeBuilder preserves fail-fast setup while the VSS builder stays synchronous. Co-Authored-By: HAL 9000

Construct PostgreSQL storage on the caller runtime now that the store only exposes async KVStore operations. This removes the store-owned runtime and its shutdown path. Co-Authored-By: HAL 9000

benthecarman · 2026-06-03T16:29:02Z

codex summary:

Benchmark Results

Payment Benchmarks

Backend	Previous	Current	Change
filesystem	6.2475 s	6.2138 s	slightly faster
sqlite	6.7378 s	6.6388 s	faster
postgres	17.669 s	17.489 s	slightly faster; Criterion reported no significant change

Lower is better. Delta compares the async-branch rerun against the previous saved Criterion results.

Bench	Backend	Previous	Async rerun	Delta
single write new	filesystem	1.844 ms	1.847 ms	+0.2%
single write new	sqlite	2.673 ms	2.658 ms	-0.6%
single write new	postgres	636.0 us	630.1 us	-0.9%
single write existing	filesystem	1.860 ms	1.881 ms	+1.1%
single write existing	sqlite	22.1 us	22.5 us	+2.0%
single write existing	postgres	638.1 us	624.9 us	-2.1%
single read	filesystem	14.7 us	15.1 us	+2.2%
single read	sqlite	17.0 us	17.3 us	+1.7%
single read	postgres	145.0 us	146.2 us	+0.8%
single remove	filesystem	908.8 us	920.0 us	+1.2%
single remove	sqlite	2.842 ms	2.728 ms	-4.0%
single remove	postgres	1.445 ms	1.470 ms	+1.8%
warm insert 100	filesystem	185.626 ms	185.919 ms	+0.2%
warm insert 100	sqlite	276.593 ms	269.530 ms	-2.6%
warm insert 100	postgres	63.417 ms	62.791 ms	-1.0%
concurrent distinct 100	filesystem	10.964 ms	10.909 ms	-0.5%
concurrent distinct 100	sqlite	272.638 ms	261.112 ms	-4.2%
concurrent distinct 100	postgres	9.177 ms	9.230 ms	+0.6%
concurrent same-key 100	filesystem	11.495 ms	13.050 ms	+13.5%
concurrent same-key 100	sqlite	249.663 ms	191.029 ms	-23.5%
concurrent same-key 100	postgres	58.739 ms	56.400 ms	-4.0%
cold insert 100	filesystem	185.829 ms	183.843 ms	-1.1%
cold insert 100	sqlite	266.497 ms	307.448 ms	+15.4%
cold insert 100	postgres	74.440 ms	72.031 ms	-3.2%
cold update 100	filesystem	188.561 ms	186.808 ms	-0.9%
cold update 100	sqlite	247.062 ms	246.978 ms	-0.0%
cold update 100	postgres	78.208 ms	74.185 ms	-5.1%
reload 100	filesystem	2.995 ms	3.002 ms	+0.2%
reload 100	sqlite	3.532 ms	3.499 ms	-0.9%
reload 100	postgres	3.237 ms	3.152 ms	-2.7%
first page from 10k	filesystem	25.290 ms	25.090 ms	-0.8%
first page from 10k	sqlite	29.8 us	24.8 us	-17.0%
first page from 10k	postgres	184.5 us	182.3 us	-1.2%
second page from 10k	filesystem	50.665 ms	50.422 ms	-0.5%
second page from 10k	sqlite	66.5 us	50.8 us	-23.7%
second page from 10k	postgres	376.7 us	372.7 us	-1.1%
lifecycle insert-update-read	filesystem	6.786 ms	3.736 ms	-44.9%
lifecycle insert-update-read	sqlite	9.866 ms	5.216 ms	-47.1%
lifecycle insert-update-read	postgres	2.533 ms	1.617 ms	-36.2%
pending insert 100	filesystem	273.515 ms	183.380 ms	-33.0%
pending insert 100	sqlite	333.859 ms	262.138 ms	-21.5%
pending insert 100	postgres	75.244 ms	75.999 ms	+1.0%
pending update 100	filesystem	197.544 ms	187.656 ms	-5.0%
pending update 100	sqlite	257.055 ms	264.137 ms	+2.8%
pending update 100	postgres	75.756 ms	63.715 ms	-15.9%

Takeaways

The async branch materially improves the lifecycle workload across all stores: filesystem improved by 44.9%, sqlite by 47.1%, and Postgres by 36.2%.
Postgres pending-payment updates improved meaningfully, dropping from 75.756 ms to 63.715 ms, a 15.9% reduction.
Most single-operation Postgres results are roughly unchanged, with small improvements on writes and small regressions/noise on reads/removes.
Postgres cold payment-store writes improved modestly: inserts improved by 3.2% and updates by 5.1%.
SQLite saw large improvements in paginated reads and same-key concurrent writes, but cold insert regressed by 15.4%, so that path is worth a closer look.
Payment benchmarks are still not trustworthy for comparison because the workload does not complete cleanly and times out waiting for HTLC slots.

joostjager · 2026-06-04T08:52:46Z

It would have been nice if async persistence would show a clear performance win here. The store-level smoke tests are useful, but the closest thing to real node load seems to be the payment benchmark, and that does not show a meaningful improvement, putting aside that it is marked as not reliable yet.

Does this tell us much about high-load node performance in practice? If not, what would be the next useful benchmark?

tnull · 2026-06-04T08:56:05Z

It would have been nice if async persistence would show a clear performance win here. The store-level smoke tests are useful, but the closest thing to real node load seems to be the payment benchmark, and that does not show a meaningful improvement, putting aside that it is marked as not reliable yet.

Hmm, note that 'async persistence' means pre and post #919 AFAIU? I.e., it does not refer to LDK's async persistence itself, which we switched to pre-#919.

benthecarman · 2026-06-04T16:12:19Z

Hmm, note that 'async persistence' means pre and post #919 AFAIU? I.e., it does not refer to LDK's async persistence itself, which we switched to pre-#919.

yeah that is pre/post 919

benthecarman · 2026-06-04T16:27:28Z

Does this tell us much about high-load node performance in practice? If not, what would be the next useful benchmark?

Added a few more with start up time, forwarding payments, and channel opens. But working on making these more robust

Benchmark realistic payment and pending-payment persistence workloads across filesystem, SQLite, and optional PostgreSQL stores. Use the async KV-store APIs so the measured paths match the database interfaces used by async persistence. AI-Assisted: Codex

Run the existing payments benchmark once per configured store backend so filesystem, SQLite, and optional PostgreSQL results are reported under the same payment flow. AI-assisted-by: OpenAI Codex

Wait for the cleanup payment to settle before starting the next payment benchmark sample. This keeps the measured forward-payment duration intact while avoiding HTLC state leaking into later samples. AI tools: Created with assistance from OpenAI Codex.

Add an operations bench target with a forwarding benchmark that compares sqlite, filesystem, and postgres stores over a settled multi-hop payment. AI-assisted-by: OpenAI Codex

Add a channel-open benchmark that measures the open_channel call while leaving chain confirmation cleanup outside the timed section. AI-assisted-by: OpenAI Codex

Add a startup benchmark that restarts a node whose store already contains channel and payment data, so startup cost reflects persisted node state. AI-assisted-by: OpenAI Codex

AI-Assisted-By: OpenAI Codex

benthecarman mentioned this pull request May 31, 2026

Add Postgres database #863

Merged

benthecarman force-pushed the db-bench branch 3 times, most recently from 5f75dd9 to bc97c3a Compare June 2, 2026 22:19

tnull added 4 commits June 3, 2026 14:22

DROPME: Bump LDK for async store migration

c23e105

Use a temporary rust-lightning fork revision that exposes async migratable KV-store support. This lets ldk-node migrate filesystem stores without reimplementing LDK's migration logic locally. Co-Authored-By: HAL 9000

DROPME: Account for vss-server location change

cb67f6c

In lightningdevkit/vss-server#101 we dropped the Java version of `vss-server` and moved the Rust version to the repo root. Here we account for the updated location in our integration tests.

Use async KV storage for static invoices

b9217a6

Static invoice persistence already runs from async handlers, so use KVStore directly instead of routing those reads and writes through the blocking KVStoreSync trait. Co-Authored-By: HAL 9000

tnull added 14 commits June 3, 2026 16:42

f Simplify

0d6c350

Move peer persistence onto async KV storage

ffceaca

Persist peer store updates through async KVStore operations. The synchronous node APIs keep bridging at their runtime boundary while async event handling awaits peer persistence directly. Co-Authored-By: HAL 9000

Move DataStore persistence onto async KV storage

90e8e59

Persist DataStore mutations through async KVStore operations while keeping the existing synchronous APIs bridged through the node runtime. Async event handling now awaits payment store writes directly. Co-Authored-By: HAL 9000

Move node metrics persistence onto async KV storage

333e9a4

Persist node metric updates through async KVStore writes and await them from the chain, gossip, and scoring tasks. This removes the remaining blocking metrics writer while keeping the helper name stable. Co-Authored-By: HAL 9000

f - Move on-chain wallet update helper out of macro

8d7d47b

Avoid introducing a temporary macro when moving node metrics persistence onto async KV storage. Co-Authored-By: HAL 9000

Use BDK's async wallet persister

b4cf32f

Persist the on-chain wallet through BDK's AsyncWalletPersister so wallet state writes use the async KVStore path. Existing synchronous wallet APIs keep bridging through the node runtime until their callers are made async. Co-Authored-By: HAL 9000

Use async KVStore migration for filesystem stores

47f80ba

Open filesystem stores through the async LDK migration helper so v1-to-v2 store migration no longer depends on the blocking KVStoreSync migration path. Co-Authored-By: HAL 9000

Move test InMemoryStore into shared module

867b0f9

Move the existing in-memory test store into a shared module without changing its behavior. This lets integration tests reuse it while keeping the later async TestSyncStore change separate. Co-Authored-By: HAL 9000

f - Preserve moved InMemoryStore code exactly

1843e38

Keep the shared test store move as a pure code move by restoring the original comments and spacing. Co-Authored-By: HAL 9000

Remove blocking KV store support

40e4aad

Drop the remaining synchronous KV store trait bounds and implementations. After the preceding migrations, custom stores only need to provide async KVStore persistence. Co-Authored-By: HAL 9000

f - Await async VSS store test helper

0ba01de

Compile the VSS persistence tests after the shared KV store helper moved to async persistence. Co-Authored-By: HAL 9000

Stop retaining a VSS runtime

3e76ccc

Resolve the VSS schema through async setup on the caller runtime so VSS stores no longer create or retain a Tokio runtime. NodeBuilder preserves fail-fast setup while the VSS builder stays synchronous. Co-Authored-By: HAL 9000

Stop retaining a PostgreSQL runtime

9ad8f2d

Construct PostgreSQL storage on the caller runtime now that the store only exposes async KVStore operations. This removes the store-owned runtime and its shutdown path. Co-Authored-By: HAL 9000

benthecarman force-pushed the db-bench branch from bc97c3a to fbc6fb7 Compare June 3, 2026 16:41

benthecarman force-pushed the db-bench branch from fbc6fb7 to c721923 Compare June 4, 2026 16:25

benthecarman added 7 commits June 4, 2026 20:47

Benchmark payments across stores

ef8694f

Run the existing payments benchmark once per configured store backend so filesystem, SQLite, and optional PostgreSQL results are reported under the same payment flow. AI-assisted-by: OpenAI Codex

Add forwarding operations benchmark

8e9fa64

Add an operations bench target with a forwarding benchmark that compares sqlite, filesystem, and postgres stores over a settled multi-hop payment. AI-assisted-by: OpenAI Codex

Add channel open operations benchmark

f23517b

Add a channel-open benchmark that measures the open_channel call while leaving chain confirmation cleanup outside the timed section. AI-assisted-by: OpenAI Codex

Add seeded startup operations benchmark

1508155

Add a startup benchmark that restarts a node whose store already contains channel and payment data, so startup cost reflects persisted node state. AI-assisted-by: OpenAI Codex

Centralize PostgreSQL test URL env var

6bd3945

AI-Assisted-By: OpenAI Codex

benthecarman force-pushed the db-bench branch from c721923 to 6bd3945 Compare June 5, 2026 01:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add database persistence benchmarks#915

Add database persistence benchmarks#915
benthecarman wants to merge 25 commits into
lightningdevkit:mainfrom
benthecarman:db-bench

benthecarman commented May 31, 2026 •

edited

Loading

Uh oh!

ldk-reviews-bot commented May 31, 2026

Uh oh!

tnull commented Jun 3, 2026 •

edited

Loading

Uh oh!

benthecarman commented Jun 3, 2026 •

edited

Loading

Uh oh!

joostjager commented Jun 4, 2026

Uh oh!

tnull commented Jun 4, 2026

Uh oh!

benthecarman commented Jun 4, 2026

Uh oh!

benthecarman commented Jun 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

benthecarman commented May 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ldk-reviews-bot commented May 31, 2026

Uh oh!

tnull commented Jun 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

benthecarman commented Jun 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Benchmark Results

Payment Benchmarks

Takeaways

Uh oh!

joostjager commented Jun 4, 2026

Uh oh!

tnull commented Jun 4, 2026

Uh oh!

benthecarman commented Jun 4, 2026

Uh oh!

benthecarman commented Jun 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

benthecarman commented May 31, 2026 •

edited

Loading

tnull commented Jun 3, 2026 •

edited

Loading

benthecarman commented Jun 3, 2026 •

edited

Loading